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nucleotide 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine / 
H=Histidine, I=Isoleucine, K= Lysine, 
L=Leucine , M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V^Valine, 
W= Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ASNYNVPLS STAQSTSARNSDS KLTWSPGSVTNTSLAHELWKVP 
LPPKNITAPSRPPPGLTGQKPPLSTWDNSPLRIGGGWGNSDARY 
T PGS SWGE SSSGRI TNWLVLKNLTP Q IDGSTLRTLCMQHGPL I T 
FHLNLPHGNALVRYSSKEEWKAQKSLHISDLFLLTL 


7117 


595 


1261 


LLISTPGGCHPPPSSIE F T YTGAWG KAL P AP HM P CAPGAL P QGA 
FVSQAARAIPLLQPSQAAQAEGLSQPARACGALCSLPWPLRNWG 
SPILRLPGGLRTPTNDRKTRTRSAMACWARAQWDTLGPLKLSHR 
GKVCLRHPRPTGVRGGPGAAGRQGGMGTRRRGTFTSGARDP3GL 


7118 


49 


1863 


P H C E PNPG AG AM VL LH VL F EHAVG Y ALLALKE VE E I S LL Q P Q VE "' 
E SVLNLGKFHS I VRLVAF CPFAS S Q VALE NANAVS EGWHEDLR 
L L LE THL P S KKKKVLLGVGDPKI GAAIQE ELG YNCQTGGV I AE I 
LRGVRLHFHNLVKGLTDLSACKAQLGLGHSYSRAKVKFNVNRVD 
NMIIQSISLLDQLDKDINTFSMRVREWYGYHFPELVKIINDNAT 
YCRLAQFIGNRRELNEDKLEKLEELTMDGAKAKAILDASRSSMG 
MDISAIDLINIESFSSRWSLSEYRQSLHTYLRS KMSQVAPSLS 
AL1 GEAVGARL I AHAG S LTNLAK Y P AS TVQ I LG AE KAL FRALKT 
RGNTPKTGLIFHSTFIGRAAAKNKGRISRYLAErKCSIASRIDCF 
SEVPTSVFGEKLREQVEERLSFYETGEIPRKNLDVMKEAMVQAE 
EAAAE I TRKLE KQEKKRLKKEKKRLAALALASS E>jTSS S TPEE CE 
EMSEKPKKKKKQKPQEVPQENGMEDPSISFSKPKKKKSFSKEEL 
MSSDLEETAGSTSIPKRKKSTPKEETVNDPEEAGHRSGSKKKRK 


7119 


49 


1863 


PHCEPNPGAGAMVLLHVLFEHAVGYALLALKEVEEISLLQPQVE 
ES VLNLGKFHS I VRLVAFCP FAS S QVALENANAVSEGWHEDLR 
LLL E THLP S KKKKVLLG VGD P K I GAA I Q EE LG YN CQTGG VI AE I 
LRGVRLHFHNLVKGLTDLSAGKAQLGLGHSYSRAKVKFNVNRVD 
NMIIQSISLLDQLDKDINTFSMRVREWYGYHFPELVKIINDNAT 
YCRLAQFIGNRRELNEDKLEKLEELTMDGAKAKAILDASRSSMG 
MD I SAIDL INI ESFSSRWSLSEYRQSLHTYLRS KMSQVAPSLS 
AL I GEAVGARL I AHAG S LTNLAK Y PAS TVQ I LGAE KAL FRALKT 
RGNT P K YGL I FHS T F I GRAAAKNKGR I S RYLAN KCS I ASR I D CF 
SEVPTSVFGEKLREQVEERLS FYETGE I PRKNLDVMKE AMVQAE 
EAAAE I TR KL E KQE KKRL KKE KKRLAALALAS S ENS S S TP E E CE 
EMSEKPKKKKKQKPQEVPQENGMEDPSISFSKPKKKKSFSKEEL 
MSSDLEETAGSTS I PKRKKSTPKEETVNDPEEAGHRSGSKKKRK 
FSKEEPVSSGPEEAAGKSSSKKKKKFHKASQED 


7120 


1991 


64 


OT iHTP. R PT iPfTn'K'^/T'MZiMnnFT .VTTvTTiTr DP TTTTT "DOT A "KIT rTTVTiC 
Vi-LJu x x\.ivu,j_ixv\juiv v J.lNni'lVii-'i. i-1 V ' im i i Pf t: nc J. ill if \f Xjr\vi LjO V vr JV1JO 

NSTTPL I FVLSPGTDPAADIiYKFAEEMKFSKKLSAI SLGQGQGP 
RAEAMMRSSIERGKWVFFQNCHLAPSWMPALERLIEHINPDKVH 
RDFRLWLTSLPSNKFPVSILQNGSKMTIEPPRGVRANLLKSYS3 
LGEDF LN S CH KVME F KS LL LS LCL FHGNALERR KFG P LG FN I P Y 
EFTDGDLRICISQLKMFLDEYDDIPYKVLKYTAGEINYGGRVTD 
DWDRRCIMNILEDFYNPDVLSPEHSYSASGIYHQIPPTYDLHGY 
LSYIKSLPLNDMPEIFGLHDNANITFAQNETFALLGTIIQLQPK 
SSSAGS OGREE I VEDVTQNI LLKVPEP INLQWVMAKYP VL YEES 
MNTVLVQEVIRYNRLLQVITQTLQDLLKALKGLWMSSQLELMA 
ASLYNNTVPELWSAKAYPSLKPLSSWVMDLLQRLDFLQAWIQDG 
IPAVFWISGFFFPQAFLTGTLQNFARKFVISIDTISFDFKVMFE 
APSELTQRPQVGCYIHGLFLEGARWDPEAFQLAESQPKELYTEM 
AVIWLLPTPNRKAQDQDFYLCPIYKTLTRAGTLSTTGHSTNYVI 
AVE I PTHQ PQRHW I KRGVAL I CALDY 


7121 


2 


546 


RPLRPWVLSLGSMVGLMTYGRRQFQSLDTTMRRLIPPFREASAK 
LTTLVDADAEAFTAYLEAMRLP KNT P EE KDRRTAALQEGLRRAV 
SVPLTLAETVASLWPALQELARCGNLACRSDLQVAAKALEMGVF 
GAYFNVLINLRDITDEAFKDQIHHRVSSLLQEAKTQAALVLDCL 



599 



WO 01/53312 



PCT7US00/34263 



SEQ 
ID 
NO: 


Predicted 

Jw/^M JLlllJL Xliy 

nucleotide 
location 
corresponding 
to first . 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine , D=Aspart ic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine , M=Methionine , N=Asparagine, 
P=Proline , Q=Glutamine , R=Arginine , 
S=Serine, T=Threonine, V=Valine, 
W= Tryptophan, Y*=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\-possible nucleotide insertion} 










7122 


2 


546 


RPLRPWLSLGSMVGLMTYGRRQFQSLDTTMRRLIPPFREASAK 
LTTLVDADAEAFTAYLEAMRLPKNTPEEKDRRTAALQEGLRRAV 
0 v JrL>i LiAbi l VAbJjWPAJbyiiljARCC?N-jACKSDLQVAAKALEM3VF 
GAYFNVLINLRDITDEAFKDQIHHRVSSLLQEAKTQAALVLDCL 
ETRQE 


7123 


1 


1092 


KPAVPEARSAGTSEAGRSGAEEVSCGSVSGDGAAMRLTPRALCS 
AAQAAWRENFPLCGRDVARWFPGHMAKGLKKMQSSLKLVDGIIE 
VHDARIPLSGRNPLFQETLGLKPHLLVLNKMDLADLTEQQKIMQ 
HLEGEGLKNVI F TNC VKD ENVKQ I I PMVTEL I GR3HRYHRKENL 
EYCIMVIGVPNVGKSSLINSLRRQHLRKGKATRVGGEPGITRAV 
MSKIQVSERPLMFLLDTPGVLAPRIESVETGLKLALCGTVLDHL 
VGEETMADYLLYTLNKHQRFGYVQHYGLGSACDNVERVLKSVAV 
KLGKTQKVKVLTGTGNVWVIQPNYPAAARDFLQTFRRGLLGSVM 
LDLDVLRGHPRV 


7124 


2 


382 


LPLTLLLAAPFAHLLLPPGHDQSPCWHPGPALSPGTLGPLSWAM 
ANSGLQLLGYFLALGGWVGI1ASTALPQWKQSSYAGDASIQLRS 
KVFVLESEWGGDSLGLPRDCGWSCLLHSAVRSEKGFWS 


7125 


166 


1127 


NCISEKRNYSFSMQKGKGRTSRIRRRKLCGSSESRGVNESHKSE 
FIELRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIIS 
LPESCLLT\RDTVIRSYLGAYITKWKPPPSPLLALCTFLVSEKH 
AGHRSLLEA\YLEILPKAYTCPVCLEPEWNLLPKSLKAKAEEQ 
RAHVQEFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNT 
RAVYL\ S PGSGNAFLQSRTPVQLAP YLDLLNHS PHVQVKAAFNE 
ETHS YE I RT T S RWRKHE E VF I C YG PHDNQRL FLE YG F VS VHWPH 

V x V oKLiWlNyjjC-o 


7126 


1 


733 


CRDMAAFIVPSPARRCSQKGSLGHLPTQPWLWAAMSPRGQERGT 
SHSQAREPQRPGRWLLGSLQSSPGTLGQAGTASRRRGCMVQRWV 
QVATGRRAVQVP KGALGLALGETS PGASRGMSGGAGGCWALGWA 
PSPVLPSWLLEGPPPWLSIISDSGTQRPSPRRCPARPSPWGPQC 
WRGGR I AS AEAS ST* TPGS GSRARSGRRS PGS RRRS ASAP S PT P 
PTDACA* SCVARPAGSRSSRPAAA 


7127 


1311 


277 


GLPAMCST*KAGYYEETEGDCIPKDR*IEKRPFKEI A "RRIPRIF 
AKQKQI*S*NSQKIGASEIDRGRKEADCSDAPAAARIGAVSVFR 
RSTQE ARVS PRSNAKS ANLRAVRAD* WEHF VLL FHTP EQFLAE C 
ICRST* *K*WHQLC* PLSSL*TGLKRKLLL*VLFRI *WLKDCDV 
* KCQKI FATNFCNWQNL I Q * EE * KPVEYSVEN* H I MNLLLPM * L 
CQSSLRDQTIVTWRM*RNYSMFRINMISSL*DGSIHIPLKLHFY 
PALIFTLTVPINSCCQRPLPLFAHQSIKTLASSGSPMLACLRFL 
LVKKRAFIHTPRSPGCSV*CKHVLVKDNKNNCVGSEV 


7128 


2 


5228 


GRVDLWTILLGRSALRELSQIEAELNKHWRRLLEGLSYYKPPSP 
SSAEKVKANKDVASPLKELGLRISKFLGLDEEQSVQLLQCYLQE 
DYRGTRDSVKTVLQDERQSQALILKIADYYYEERTCILRCVLHL 
LTYFQDERHPYRVEYADCVDKLEKELVSKYRQQFEELYKTEAPT 
WETHGNLMTERQVSRWFVQCLREQSMLLEIIFLYYAYFEMAPSD 
LLVLTKMFKEQGFGS RQTNRHLVDETMD P FVDR I G Y FSAL I LVE 
GMDIESLHKCALDDRRELHQFAQDGLICQDMDCLMLTFGDIPHH 
APVLLAWALLRHTLNPEETSSWRKIGGTAIQLNVFQYLTRLLQ 
SLASGGNDCTTSTACMCVYGLLSFVLTSLELHTLGNQQDIIDTA 
CEVLADPSLPELFWGTEPTSGLGIILDSVCGMFPHLLSPLLQLL 
RAL VSGKS TAKKVYS FLDKMS FYNEL YKHKPHDVISHEDGTL WR 
RQTPKLLYPLGGQTNLRI PQGTVGQVMLDDRAYLVRWE YSYS SW 
TLFTCEIEMLLHWSTADVIQHCQRVKPIIDLVHKVISTDLSIA 
DCLLPITSRIYMLLQRLTTVISPPVDVIASCVNCLTVIiAARNPA 
KVWTDLRHTGFLPFVAHPVSSLSQMISAEGMNAGGYGNLLMNSE 
QPQGEYGVTIAFLRLlTTLVKGQLGSTQSQGIiVPCVMFVLKEML 
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amino acid 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, [^Methionine, N=Asparagine, 
P^Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PSYHKWRYNSHGVREQIGCLILELIHAILNLCHETDLHSSHTPS 
LQFLCICSLAYTEAGQTVINIMGIGVDTIDMVMAAQPRSDGAEG 
QGQGQLL I KTVKLAFS VTNNV I RLKPPSNWS PLEQALSQHGAH 
GNNLIAVLAKYIYHKHDPALPRLAIQLLKRLATVAPMSVYACLG 
NDAAAIRDAFLTRLQSK\IE\DMRIK\VMIL\EFLTVA\VETQP 
GLIELFLNLEVKDG\SDGSKEFSLGMW\SCLHAV/VWEL1DSQQ 
QDRYWCPPLLHRAAIAFLHALWQDRRDSAMLVLRTKPKFWENLT 
SPLFGTLSPPSETSEPSILETCAL1MKIICLEIYYWKGSLDQP 
LKDTLKKFS1EKRFAYWSGYVKSLAVHVAETEGSSCTSLLEYQM 
LVSAWRMLLIIATTHADIMHLTDSWRRQLFLDVLDGTKALLLV 
PASVNCLRLGSMKCTLLLILLRQWKRELGSVDEILGPLTEILEG 
VLQADQQLMEKTKAKVFSAF I TVLQMKEMKVSD I PQYSQLVLNV 
CETLQEEVIALFDQTRHSLALGSATEDKDSMETDDCSRSRHRDQ 
RDGVCVLGLHLAKELCEVDEDGDSWLQVTRRLPILPTLLTTLEV 
S LRM KQNLH FTE AT LHLLLT LARTQQGATAVAGAG I TQSICLPL 
LSVYQLSTNGTAQTPSASRKSLDAPSWPGVYRLSMSLMEQLLKT 
LRYNFLPEALDFVGVHQERTLQCLNAVRTVQSLACLEEADHTVG 
FILQLSNFMKEWHFHLPQLMRDIQVNLGYLCQACTSFLHSRKML 
QHYLQNKNGDGLPSAV\AQRV\QRPPSAASAAPSSSKQPAADTE 
ASEQQALHTVQYGLLKILSKTLAALRHFTPDVCQrLLDQSLDLiA 
E YN F L FAL S FT T P T FD S E VAP S FGTLLAT VNVALNM LGE LD KKK 
E PLTQAVGLS TQAEGTRTLKS LLMFTMENCFYLLISQAMRYLRD 
PAVHPRDKQRMKQELSSELSTLLSSLSRYFRRGAPSSPATGVLP 
SPQGKSTSLSKASPESQEPLIQLVQAFVRHMQR 


7129 


1 


1054 


FRR FRWRRRLH * AGPAS SAGGS PGE AS G TMS GELP PN IN I KE PR 
WDQSTFIGRANHFFTVTDPRNILLTNEQLESARKIVHDYRQGIV 
P PGLTENEL WRAKY I YDS AFH PDTGE KM I L I GRMS AQ VPMNMT I 
TGCMMTFYRTTPAVLFWQWINQSFNAWNYTNRSGDAPLTWEL 
GTAYVS AT TGAVAT ALG LNAL TKH VS P L I GRF VP FAAVAAAN C I 
NT PLMRQRELKVGI PVTDENGNRLGESANAAKQAI TQVWSRIL 
MAAPGMAI PPF IMNTLEKKAFLKRFP WMSAP I QVGLVGFCLVFA 
TPLCCALFPQKSSMSVTSLEAELQAKIQESHPELRRVYFNKGL 


7130 


2 


780 


HEVPSLQTSDPLPGSVQRCSVWSQPNKENWCQDHLYNSLGRKG 
ISAKSQPYHRSQSSSSVLINKSMDSINYPSDVGKQQLLSLHRSS 
RCESHQDLLPDIADSHQQGTEKLSDLTLQDSQKWWNRNLPLN 
AQIATQNYFSNFKETDGDEDDYVEIKSEEDESELELSHNRRRKS 
DSKF VDADFSDNVCSGNTLHSLNS PRTPKKPVNS KLGLS PYLTP 
YNDS DKLND YLWRG P S PNQQNI VQS LRE KFQCLS S S S FA 


7131 


805 


573 


AAAEGH I EWKFL I EACKVWPFAKDRWGNI PLDDAVQFNHLE W 
KLLQD YQDS YTLS E TQAE AAAE ALS KEKLES MV 


7132 


1420 


1087 


IDMLLLSGALVSGPYTLITTAVSADLGTHKSIiKGNAHALSTVTA 
IIDGTGSVGAALGPLLAGLLSPSGWSNVFYMLMFADACALLFLI 
RLIHKELSCPGSATGDQVPFKEQ 


7133 


2 


3648 


QQIPGLLPAHGESGDALRKPRLQKPITGHLDDLFFTLYPSLEKF 
EEELLELHVQDHFQEGCGPLDGGALEILERRLRVGVHNGLGFVQ 
RPQWVLVPEMDVALTRSASFSRKWSSSKTSSGSQALVLRSRL 
RLPEMVGHPAFAVIFQLEYVFSSPAGVDGNAASVTSLSNLACMH 
MVRWAVWNPLLEADSGRVTLPLQGGIQPNPSHCLVYKVPSASMS 
SEEVKQVESGTLRFQFSLGSEEHLDAPTEPVSGPKVERRPSRKP 
PTSPSSPPAPVPRVLAAPQNSPVGPGLSISQLAASPRSPTQHCL 
ARPTSQLPHGSQASPAQAQEFPLEAGISHLEADLSQTSLVLETS 
IAEQLQELPFTPLHAPIWGTQTRSSAGQPSRASMVLLQSSGFP 
EILDANKQPAEAVSATEPVTFNPQKEESDCLQSNEMVLQFLAFS 
RVAQDCRGTSWPKTVYFTFQFYRFPPATTPRLQLVQLDEAGQPS 
SGALTHILVPVSRDGTFDAGSPGFQLRYMVGPGFLKPGERRCFA 
RYLAVQTLQIDVWDGDSLLLIGSAAVQMKHLLRQGRPAVQASHE 
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to first 
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amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, (^Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K= Lysine, 
L=Leucine , M=Methionine , N=Asparagine , 
P=Proline, Q=Glutaraine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion/ 
\-possible nucleotide insertion) 








TiP^AfQ T r P'Y'RO'n7\rM\ArQ<^i'nMT (~ , T?r*'D\r~fT}Tr~<'\T'CJC!\T\TTrr~'TTn ur tit 7> "kt 
jjjl v v j-i± u, x nyuiwiv V o^jJJl v lijijr^jKV Airlta Vrlo v VKoKLHLi i LAN 

VGHPCEQKVRGCSTLPPSRSRVISNDGASRFSGGSLLTTGSSRR 
KHWQAQKLADVDSELAAMLLTHARQGKGPQDVSRESDATRRRK 
LERMRSVRLQEAGGDLGRRGTSVLAQQSVRTQHLRDLQVIAAYR 
ERT KAES I AS L L S LAI T TEHTLHATLGVAEFFE F VL KNPHNTQH 
TVTVEIDNPELSVIVDSQEWRDFKGAAGLHTPVEEDMFHLRGSL 
APQLYLRPHETAHVPFKFQSFSAGQLAMVQASPGLSNEKGMDAV 
S PWKS SAVPTKHAKVLFRASGGKP I AVLCLTVELQFHWDQVFR 

c mr jujop XjxvrV-rtJLKljlr Jr Wri 1 r irvj/lF V oL v ll_i<j.fc!i JJ P P VHVRL.iDL)PNV 
ICETQNVGPGEPRDIFLKVASGPSPEIKDFFVIIYSDRWLATPT 
QTWQVYLHSLQRVDVSCVAGQLTRLSLVLRGTQTVRKVRAFTSH 
P QE L KTD P KGVF VL P P RG VQD LH VGVR P LRAG S R F VHLNL VDVD 
CHQLVAS WLVCLCCRQPL I SKAFE IMLAAGEGKGVNKRI T YTNP 
YPSRRTFHLHSDHPELLRFREDSFQVGGGETYTIGLQFAPSQRV 
GEEEILIYINDHEDKNEEAFCVKVIYQ 


7134 


2115 


1111 


GGEGFSYPPHVGLSLGTPLDPHYVLLEVHYDNPTYEEGLIDNSG 
LRLFYTMDIRKYDAGVIEAGLWVSLFHTIPPGMPEFQSEGHCTL 
E C LE E ALE AE KP S G I H VF AVL LHAHL AGRG I R LRHFR KG KE MKL 
LA YD DD FD FN FQEFQYLKEEQTILP GDNL I T E CR YNTKDRAEMT 
WGGLSTRSEMCLS YLLYYPRINLTRCAS I PD IMEQLQF IGVKEI 
YRPVTTWPFI I KS P KQYKNLS FMDAMNKFKWT KKEGL S FNKLVL 
SLPVNVRCSKTDNAEWSIQGMTALPPDIERPYKAEPLVCGTSSS 
SSLHRDFS INLLVCLLLLS CTLSTKS L 


7135 


2 


2072 


FVPRVTPRSLSLQGPKGESVGSITQPLPSSYLIFRAASESDGRC 
1/yijiJAJLihjJjLB.ljKCfebijbRLGl LKPCjRDGEPGTS PDASPSSLCGLPA 
SATVHPDQDLFPLNGSSLENDAFSDKSERENPEESDTETQDHSR 
KTESGSDQSETPGAPVRRGTTYVEQVQ2ELGELGEASQVETVSE 

ENKSLMWTLLKQLRPGMDLSRWLPTFVLEPRSFLNKLSDYYYH 

AnTiTiC!p a a^jTTTrna VQPM"K"TArT,T?rATVT a r* ^vt/'T/DXT r I'l/nvMn t t r* 
lAuiJij&isjri^v c,n±j±\i. ot\.l y iJ\Jj V JjKH jlJjovtJ I l\J\Jr J_ AKJr Y£J JrlLG 

ETFRCCWFHPQTDSRTFYIAEQVSHHPPVSAFHVSNRKDGFCIS 
GSITAKSRFYGNSLSALLDGKATLTFLNRAEDYTLTMFYAHCKG 
ILYGTMTLELGGKVTIECAKWNFQAQLEFKLKPFFGGSTSINQI 
SGKITSGEEVLASLSGHWDRDVFIKEEGSGSSALFWTPSGEVRR 
QRLRQHTVPLE E QTE LES ERLWQHVTRAI S KGDQHRATQE KFAL 
EEAQRQRARERQESLMPWKPQLFHLDPITQEWHYRYEDHSPWDP 
LKDIAQFEQDGILRTLQQEAVARQTTFLGSPGPRHERSGPDQRL 
RKASDQPSGHSQATESSGSTPESCPELSDEEQDGDFVPGGESPC 
PRCRKEARRLQALHEAILSIREAQQELHRHLSAMLSSTARAAQA 
PTPGLLQSPRSWFLLCVFLACQLFINHILK 


7136 


2 


41B 


DFVPSFRRPSGNTSQTVWLLRAATLEKEVAGLREKIHHLDDMLK 
S QQRKVRQM IEQLQNS KAV I QS KDAT I QELKEKIAYLEAENLEM 
HDRMEHLIEKQISHGNFSTQARAKTENPGSIRISKPPSPKPMPV 
IRWET 


7137 


2 


466 


WASGMSTVPGGSRHSLGIQVRGGWGVTGGEEESLTVPVADTWQA 
GSFKVATQERNPQRAQMRLRRQKKGWPFLGDFLTELQRLDSAI 
PDDLDGNTNKRSKEVRVLQEMQLLQVAAMWYRLRPLEKFVTYFT 
RMEQLSDKESYKLSCQLEPENP 


713 8 


2 


466 


WAS GMS T V PGGS RH S LG I Q VRGGWG VTGG EE E SLT VP VADTV7Q A 
GSFKVATQERNPQRAQMRLRRQKKGWPFLGDFLTELQRLDSAI 
PDDLDGNTNKRSKEVRVLQEMQLLQVAAMNYRLRPLEKFVTYFT 
RMEQLSDKESYKLSCQLEPENP 


7139 


1 


357 


S LRNS ARGLKMAASAARGAAALRRS INQ PVAFVRR I PWTAAS S Q 
LKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRNALQ 
QENHIIDGVKVQVHTRRPKLPQTSDDEKKDF 


7140 


1401 


1357 


RASSLQVLKAWGGLIPSSFQQQHTGQYALEELFDLKVYDCFCSF 
NMNVSLEKQLRPSQPWPRGKCRKTPGWEEARPKAQDLRGDLGKT 
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Predicted end 
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amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q^Glutamine, R=Arginine, 
S^Serine , T=Threonine , V=Val ine , 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QAGPAEAHTRGPPRLPAATGCPPHLPGLLSGISVDIDPTGLQSQ 
WTPKGQDPPLMFSEDYQKSLLEQYHLGLDQKLRKYWGELIWNF 
ADFMTNQCG 


7141 


124 


1073 


LDSRSCWLDMEDLEEDVRFIVDETLDFGGLSPSDSREEEDITVL 
VTPEKPLRRGLSHRSDPNAVAPAPQGVRLSLGPLSPEKLEE1LD 
EANRLAAQLEQCALQDRESAGEGLGPRRVKPSPRRETFVLKDSP 
VRDLLPTVNSLTRSTPS /LKQPDASTPE * * * EGVSQGS PGYI WK 
EALQHEEGVTHLQSVPCIQKPSIFSS\SRSTPPVRGRAGPSGRA 
AASEETRAAKLRGAAAKSSCQLPIPSAIPRPASRMPLTSRSVPP 
GRGALPPDSLSTRKGLPRPSTAGHRVRESGHKVPVSQRLNLPVM 
GATRSNLQPP 


7142 


658 


839 


LIFLMLHMELKMLSSVTLHIRAFLYWICLKPTSCLIFQNVLNLL 
KK*SRAVGVWVMCRT/YSSDLQVGVIKPWLLLGSQDAAHDLDT 
LKKNKVTHILNVAYGVENAFLSDFTYKSISILDLPETNILSYFP 
ECFEFIEEAKRKDGWLVHCNA 


7143 


3 


773 


SLEMSSDGEPLSRMDSEDSISSTIMDVDSTISSGRSTPAMMNGQ 
GSTTSSSKNIAYNCCWDQCQACFNSSPDLADHIRSIHVDGQRGG 
VFVCLWKGCKVYNTPSTSQSWLQRHMLTHSGDKPFKCWGGCNA 
SFASQGGLARHVPTHFSQQNSSKVSSQPKAKEESPSKAGMNKRR 
KLKNKRRRSLARPHDFFDAQTLDAIRHRAICFNLSAHIESLGKG 
HSWFHSTVSILLFFQIKYKTLQKNISTIISKSLKI 


7144 


1 


988 


FRVNMQDGGPSPAEHSKAEESAGMEARFLGLPDAAGSSGPTPAR 
R CPAPR PAG VS Y VI RDE VE K YNRNG VNALQLD PALNRL F TAGRD 
SIIRIWSVNQHKQDPYIASMEHHTDWVNDIVLCCNGKTLISASS 
DTTVKVWNAHKGFCMSTLRTHKDYVKALAYAKDKELVASAGLDR 
QIFLWDVNTLTALTASNNTVTTSSLSGNKDSIYSLAMNQLGTII 
VSGSTEKVLRVWDPRTCAKLMKLKGHTDNVKALLLNRDGTQCLS 
GSSDGTIRLWSLGQQRCIATYRVHDEGVWALQVNDAFTHVYSGG 
RDRKI YCTDLRNPD I RVLI CE 



TRADOCS: 1 4 1 6260. 1 (%CSK0 1 ! .DOC) 



603 



WO 01/53312 



PCT7US00/34263 



WHAT IS CLAIMED IS: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO:l-1786 and 3573-5358, a mature protein coding portion 
of SEQ ID NO:l-1786 and 3573-5358, an active domain of SEQ ID NO:M786 and 
3573-5358, and complementary sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide hybridizes to the polynucleotide of claim 1 under stringent 
hybridization conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide has greater than about 90% sequence identity with the 
polynucleotide of claim 1 . 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1. 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

1 0. An isolated polypeptide, wherein the polypeptide is selected from the group 
consisting of: 



604 



hV,y. ' vv . p -• ■ T :•: ns :::: 

WO 01/53312 PCT/US00/34263 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent 
conditions with any one of SEQ ID NO: 1-1786 and 3573-5358. 

11. A composition comprising the polypeptide of claim 10 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 

•13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of claim 1 for a period sufficient to form the complex; 
and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions 
with nucleic acid primers that anneal to the polynucleotide of claim 1 under such 
conditions; 

b) amplifying a product comprising at least a portion of the 
polynucleotide of claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in 

the sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribing an annealed RNA molecule into a cDNA 
polynucleotide. 

16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 
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a) contacting the sample with a compound that binds to and forms a 
complex with the polypeptide under conditions and for a period sufficient to form the 
complex; and 

b) detecting formation of the complex, so that if a complex formation 
is detected, the polypeptide of claim 10 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound 
complex is detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10, in a 
cell, under conditions sufficient to form a polypeptide/compound complex, wherein die 
complex drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence 
expression, so that if the polypeptide/compound complex is detected, a compound that 
binds to the polypeptide of claim 10 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
from the group consisting of a polynucleotide sequence of SEQ ID NO: 1-1 786 and 3573- 
5358, a mature protein coding portion of SEQ ID NO:l-1786 and 3573-5358, an active 
domain of SEQ ID NO:l-1786 and 3573-5358, complementary sequences thereof and a 
polynucleotide sequence hybridizing under stringent conditions to SEQ ID NO: 1-1786 
and 3573-5358, under conditions sufficient to express the polypeptide in said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 
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20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of any one of the polypeptides SEQ ID NO: 1787 -3572 and 5359-7144, 
the mature protein portion thereof, or the active domain thereof 

21. The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprising the sequence 
information of at least one of SEQ ID NO:l-1786 and 3573-5358. 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid 
array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of 
the polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any one of 
the polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer- 
readable format. 

27. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 10 or 20 
and a pharmaceutical^ acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that specifically 
binds to a polypeptide of claim 10 or 20 and a pharmaceutical^ acceptable carrier. 
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